Persistence stability for geometric complexes 



Frederic Chazalf Vin de Silva} Steve Oudot* 



July 16, 2012 



Abstract 



In this paper we study the properties of the homology of different geometric filtered complexes 
(such as Vietoris-Rips, Cech and witness complexes) built on top of precompact spaces. Using 
recent developments in the theory of topological persistence [6] we provide simple and natural 
proofs of the stability of the persistent homology of such complexes with respect to the Gromov- 
Hausdorff distance. We also exhibit a few noteworthy properties of the homology of the Rips 
and Cech complexes built on top of compact spaces. 



1 Introduction 

Inferring topological properties of metric spaces from approximations is a problem that has attracted 
special attention in computational topology recently. Given a metric space (Y, dy) approximating 
an unknown metric space (X, dx), the aim is to build a simplicial complex on the vertex set Y 
whose homology or homotopy type is the same as X. Note that, although Y is finite in many 
applications, finiteness is not a requirement a priori. 

Among the many geometric simplicial complexes available to us, the Vietoris-Rips complex (or 
simply 'Rips complex') is particularly useful, being easy to compute and having good approximation 
properties. We recall the definition. Let (X, d x ) be a metric space and a a real parameter (the 
'scale'). Then Rips(A, a) is the simplical complex on X whose simplices are the finite subsets of X 
with diameter at most a: 



When (X,dx) is a closed Riemannian manifold, J.-C. Hausmann |14j proved that if a > is 
sufficiently small then the geometric realisation of Rips(A, a) is homotopy equivalent to X. This 
result was later generalised by J. Latschev [15J, who proved that if (Y,dy) is sufficiently close 
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to (X, dx) in the Gromov-Hausdorff distance, then there exists a > such that Rips(Y, a) is 
homotopy equivalent to X. Recently, Attali et al. p] adapted these results to a class of sufficiently 
regular compact subsets of Euclidean spaces. For larger classes of compact subsets of Riemannian 
manifolds, the homology and homotopy of such sets are known to be encoded in nested pairs of 
Vietoris-Rips complexes [7], yet it remains still open whether or not a single Rips complex can 
carry this topological information. 

These approaches make it possible to recover the topology of a metric space (X, dx) from a suffi- 
ciently close approximation (Y, dy), provided that the parameter a is chosen correctly. Unfortu- 
nately, this choice very much depends on the geometry of X and can be difficult (if even possible) 
to determine in practical applications. One way round this issue is to use topological persis- 
tence [HI [16], which encodes the homology of the entire nested family Mips(X) = (Rips(X, a)) a6 R 
in a single invariant, the persistence diagram. Relevant scales a can then be selected by the user, 
and the diagram provides an explicit relationship between the choice of a scale a and the homology 
of the corresponding Vietoris-Rips complex. 

The stability of this construction was established by Chazal et al. who proved that 

d b (dgm(Mips(X)),dgm(Mips(y))) sc 2d GH (^,^), (1-1) 

for finite metric spaces (X, dx) and (Y, dy)- Here db and dcH denote the bottleneck [9j Chap. 8] and 
Gromov-Hausdorff distances, respectively. The bound turns out to be tight, which motivates the 
use of persistence diagrams as discriminative signatures to compare geometric shapes represented 
as finite metric spaces. 

In this paper we show that the same inequality holds for all precompact metric spaces, and can 
in fact be extended to a larger class of filtered geometric complexes. Our analysis adopts a new 
perspective, based on recent developments in the theory of topological persistence [6], which results 
in simple and natural proofs. Our contributions are the following: 

• Extending the concept of simplicial map between complexes to the one of e-simplicial multival- 
ued map between filtered complexes, we show that such maps induce canonical e-interleavings 
between the persistent homology modules of these complexes — Section [3j 

• Applying this result to correspondences between metric spaces, we prove the e-interleaving of 
the persistent homology modules of certain families of filtered geometric complexes (including 
Rips nitrations) built on top of e-close metric spaces — Section |4| 

• We prove the tameness of the persistent homology modules of the above filtered complexes 
when the vertex sets are precompact. Combined with the previous results, this result shows 



that inequality 1.1 can be generalised as claimed above — Section 5.1 



In addition to this consistent set of results, the second part of Section [5] presents a few noteworthy 
properties of the homology groups of Rips and Cech complexes of precompact metric spaces. 
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2 Persistence modules and persistence diagrams 



We adopt the approach and the notation of [6] . In this section we recall the definitions and results 
that we need. For a detailed presentation the reader is referred to [6]. 

A persistence module V over the real numbers R is an indexed family of vector spaces^ (V a \ a e R) 
and a doubly-indexed family of linear maps (v b : V a — ► Vf, \ a ^ b) which satisfy the composition 
law v^j o v b = v c a whenever a ^ b ^ c, and where v% is the identity map on V a . 

Example 2.1 (homology of a filtered complex). This is the standard example, which we use 
throughout this paper. Let § be a filtered simplicial complex: that is, a family (S a a e R) 
of subcomplexes of some fixed simplicial complex S, such that S a c: § b whenever a ^ b. Let 
V a = H(S a ) be the homology groupj^jof §> a , and let v b a : H(S a ) —*■ H(§5) be the linear map induced 
by the inclusion §> a <—* S&. Since, for any a ^ b ^ c, the inclusion § a § c is the composition of the 
inclusions S a and S& S c , the linear maps satisfy v c a = v%ov b and the family (H(S a ) | a e R) 

is a persistence module. 

Let U,V be persistence modules over R, and let e be any real number. A homomorphism of 
degree e is a collection $ of linear maps 

O a : U a -»■ V a+t | a e R) 

such that v b a ^ fL a <f) a = <j) b o u b a for all a ^ b. We write 

Hom e (U, V) = {homomorphisms U — » V of degree e}, 
End e (V) = {homomorphisms V —* V of degree e}. 

Composition is defined in the obvious way. 

For e ^ 0, the most important degree-e endomorphism is the shift map 

1^ e End e (V), 

which is the collection of maps (v^ +e ) from the persistence structure on V. If $ is a homomorphism 
U — > V of any degree, then by definition = ly$ for all e ^ 0. 



Example (2.1 continued). Given e > 0, if / : 8 —* S' is a simplicial map such that / maps S a 
to S' a+e for any a e R, then / induces a homomorphism of degree e between the persistence modules 
H(S) and H(S'). 

Two persistence modules U, V are said to be e-interleaved if there are maps 

$ e Horn 6 (U, V) , * e Horn 6 (V, U) 
such that = l?f and = l?r e . 



1 AU vector spaces are taken to be over an arbitrary field k, fixed throughout this paper. 
2 We use simplicial homology with coefficients in the field k. 
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Following [U [6] we say that a persistence module V is q-tame if 

r„ = rank(i^) < oo whenever a < b. 
This regularity condition ensures that persistence modules behave well: 

Theorem 2.2 ([6j, Section 2.8 and Theorem 4.21). If U is a q-tame module then it has a well- 
defined persistence diagram dgm(U). If\J,V are q-tame persistence modules that are e-interleaved 
then there exists a e-matching between the multisets dgm(U), dgm(V), i.e. the bottleneck distance 
between the diagrams is bounded: db(dgm(U), dgm(V)) ^ e. 

3 Multivalued maps 

The notion of a simplicial map between simplicial complexes extends to the notion of an e-simplicial 
map between filtered simplicial complexes in the following way: 

Definition 3.1. Let S and T be two filtered simplicial complexes with vertex sets X and Y 
respectively. A map / : X — > Y is e-simplicial from S to T if it induces a simplicial map S a — * T a+£ 
for every a e R. Equivalently, / is e-simplicial if and only if for any a e R and any simplex a e § a , 
/(<r) is a simplex of T a+£ . 

We wish to extend this concept to multivalued maps. Here are the basic notions. 

A multivalued map C : X =» Y from a set X to a set Y is a subset of X x Y, also denoted C, 
that projects surjectively onto X through the canonical projection ttx '■ X x Y —> X. The image 
C(a) of a subset a of X is the canonical projection onto Y of the preimage of a through irx- 

A (single-valued) map / from X to Y is subordinate to C if we have (x,f(x)) e C for every 
x e X. In that case we write / : X — *■ Y. 

The composite of two multivalued maps C : X Y and D : Y =» Z is the multivalued map 
D o C : X =S Z, defined by: 

(x,z) e D o C there exists j/eY such that (x, y) £ C and (y, z) e D 

The transpose of C, denoted C T ', is the image of C through the symmetry map [x, y) <— > (y,x). 
Although C T is well-defined as a subset of Y x X, it is not always a multivalued map because it 
may not project surjectively onto Y. 

We now discuss simplicial multivalued maps. 

Definition 3.2. Let S and T be two filtered simplicial complexes with vertex sets X and Y 
respectively. A multivalued map C : X Y is e-simplicial from S to T if for any a e R and any 
simplex a 6 S a , every finite subset of C{a) is a simplex of T a+£ . 

Proposition 3.3. Let C : X ^Y be an e-simplicial multivalued map from S to T. Then C induces 
a canonical linear map H(C) e Hom £ (H(S), H(T)) ; equal to H(/) for any f subordinate to C. 
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Proof. Any choice of / induces a simplicial map §> a — > T a+e at each a e R, and these maps commute 
with the inclusions S a S&, T a+t <^-* Tf, +e for all a ^ b. Thus / induces H(/) e Hom e (H(S), H(T)). 
Any two subordinate maps /i, /2 : A — > Y induce simplicial maps S a — > T a+e which are contiguous 
and therefore H(/i) = H^)- Thus the map H(C) is uniquely defined. □ 

Another immediate consequence is that the induced homomorphism is invariant under taking sub- 
sets of C that are also multivalued maps: 

Proposition 3.4. If C £ C : X z^Y and C is e-simplicial from § to T, then C is e-simplicial 
from S to T and H(C") = H(C) 

Proof. Since C" is a mutivalued map contained in C, it is also e-simplicial, and any map / : X — > Y 
that is subordinate to C is also subordinate to C, so we have H(C") = H(C). □ 

Finally, induced homomorphisms compose in the natural way: 

Proposition 3.5. Let S, T, U 6e filtered complexes with vertex sets X , Y , Z respectively. If 

C : X ^Y is a e-simplicial multivalued map from S to T, 
D : y => Z is a 5-simplicial multivalued map from T to U, 

i/ien i/ie composite D o C : X ^ Z is a (e + 5) -simplicial multivalued map from S to U, and 
H(DoC) = H(D)oH(C). 



Proof. D o C is (e + 5) -simplicial as an immediate consequence of the definition of e-simplicial 
multivalued map. Let / : X — > Y be subordinate to C, and let g : Y — > Z be subordinate to I?. 
The composite g o f : X Z is subordinate to -D o C, therefore H(D o C) = H(Z?) o H(C). □ 



4 Correspondences 



4.1 Interleaving persistence modules of filtered complexes through correspon- 
dences 



Definition 4.1. A multivalued map C : X Y is a correspondence if the canonical projection 
C — > Y is surjective, or equivalently, if C T is also a multivalued map. 



We immediately deduce, if C is a correspondence, that the identity maps tx = {(x,x) : x e X} 
and ly = {(y,y) : y e F} satisfy: 

lyCCoC T 



From this property and propositions 3.4 and 3.5, we deduce the following result: 
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Proposition 4.2. Let S, T be filtered complexes with vertex sets X, Y respectively. If C : X ^ Y 
is a correspondence such that C and C T are both e-simplicial, then together they induce a canonical 
e -interleaving between H(S) and H(T), the interleaving homomorphisms being H(C) and H(C T ). 

4.2 Applications to filtered complexes on metric spaces 

When (X,dx) and (Y,dy) are metric spaces, the distortion of a correspondence C : X Y is 
defined as follows: 

dis(C) = sup{\dx(x,x') - dy(y,y')\ : (x,y), (x',y) e C) 

The Gromov-Hausdorff distance ([3], Theorem 7.3.25) between (X, dx) and (Y, dy) is then defined 
as the infimum of the distortions among all the correspondences between X and Y: 

dcH(^) Y) = | inf{dis(C) : C is a correspondence X ^ Y} 

Although doH is not necessarily finite, it is a distance on the set of isometry classes of compact 
metric spaces: 

• it is zero if and only if the spaces are isometric; 

• a correspondence and its transpose have the same distortion, so doH is symmetric; 

• the composite of two correspondences C, C is a correspondence and dis(C o C) ^ dis(C") + 
dis(C) so dcH satisfies the triangle inequality: dcjm^X, Z) ^ dGn(X, Y) + dGu(Y,Z). 

The theme of the next few examples is that low-distortion correspondences give rise to e-simplicial 
maps on filtered complexes. 

The Cech complex 

Let (X, dx) be a metric space. For a e R we define a simplicial complex Cech(X, a) on the vertex 
set X by the following condition: 

k 

[x ,xi,...,x k ] e Cech(X,o) ^> ^\B(xi,a)=£0 

i=o 

Here B(x,a) = {x 1 e X : dx(x,x') ^ a} denotes the closed ball with centre x e X and radius a. 
Any point x in the intersection f > \ i B(xi, a) is called an a-centre for the simplex [xq, . . . , 

For a ^ 0, note that Cech(A% a) consists of the vertex set X alone. There is a natural inclusion 
Cech(X, a) Q Cech(X, b) whenever a b. Thus, the simplicial complexes Cech(X, a) together with 
these inclusion maps define a filtered simplicial complex Cech(X) on X. 

Lemma 4.3. Let (X , dx) , (Y, dy) be metric spaces. For any e > 2dcn(X,Y) the persistence 
modules H(Cech(X)) and H(Cech(Y)) are e -interleaved. 
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Proof. Let C : X Y be a correspondence with distortion at most e. 

Consider a e Cech(X, a). Let x be an a-centre for a, so dx(x,x) ^ a for all x e ex. Pick y e C(x). 
Now for any y e C(o~) we have y e C{x) for some x e a, and therefore: 

dY{y,y) ^ dx{x,x) + e ^ a + e 

Let t c C(ct) be any finite subset; then y is an (a + e)-centre for r and hence r e Cech(Y, a + e). 

We have shown that C is e-simplicial from Cech(X) to Cech(y). S yme trically, C T is e-simplicial 
from Cech(y) to Cech(X). The result now follows from Proposition 4.2 □ 



The Vietoris Rips complex 



Let (X, dx) be a metric space. For a e R we define a simplicial complex Rips(X, a) on the vertex 
set X by the following condition: 

[xo, xi, . . . , Xk] e Rips(X, a) dx(xi,Xj) ^ a for alii, j 

For a ^ 0, note that Rips(X, a) consists of the vertex set X alone. There is a natural inclusion 
Rips(X, a) ci Rips(X, 6) whenever a ^ b. Thus, the simplicial complexes Rips(X, a) together with 
these inclusion maps define a filtered simplicial complex Mips(X) on X. 

Lemma 4.4. Let (X,dx), (Y,dy) be metric spaces. For any e > 2dcH(^, y) the persistence 
modules H(IRips(X)) and H(IRips(y)) are e -interleaved. 

Proof. Let C : X =» Y be a correspondence with distortion at most e. 

If a e Rips(X, a) then dx(x,x') ^ a for all x,x' e a. Let r c: C(<r) be any finite subset. For any 
y,y' e r there exist x,x' e a such that y £ C(x), y' e C(x'), and therefore: 

(y , y) < (», < a + e 

It follows that r e Rips(Y, a + e). 

We have shown that C is e-simplicial from Mips(X) to Mips(Y). Symetrically, C T is e-simplicial 



from Mips(y) to Mips(X). The result now follows from Proposition 4.2 □ 



The witness complex 

Let L, W be two sets and A : W x L — > R + a non negative function. For a e R we define a 
simplicial complex Wit(L, W; a) with vertex set L by the following conditions: 

fVr£CT, re Wit(L,W;a) 
a e Wit(L,W;a) <s> j and 

I 3w e W s.t. A(w, Z) < A(w, I') + a for all / e cr and I' e L\a 
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Whenever the second condition is verified by a simplex a cz L and a point w e W, we say w is an 
a-witness for a. An a-witness is obviously a 6- witness for any 6 ^ a, so there is a natural inclusion 
Wit(L, W; a) <= Wit(L, VF; 6). The simplicial complexes Wit(L, W; a) together with these inclusion 
maps define a filtered simplicial complex Wit(L, W) with vertex set L. 

Let L be a set, W, W be two witness sets of L with respect to maps A : W x L —* R + and 
A' : W' xL-> R + . The distortion (with respect to L, A and A') of a correspondence C :W =» W' 
is defined by 

dis(C) = sup sup | A(w ,1) -A' (v/,l) \ 

(w,w')eC leL 

Lemma 4.5. Let L be a set, W, W be two witness sets of L with respect to maps A : W x L — > R + 
and A' : W' xL-> R + . For any e such that 

e > inf{dis(C) : C is a correspondence W W } 

the persistence modules H(Wit(L, W)) and H(Wit(L, W')) are e -interleaved. 



Proof. Let C : W =» W be a correspondence such that dis(C) < e. We claim that every simplex 
a c L with an a-witness in has an (a + e)-witness in W'. Indeed, let w e W be an a-witness 
for a, and let w 1 e W' be such that (w, w') e C. Then, for all I e a and V e L\o~, we have 
A'(w', I) ^ A(w, I) + e A(w, V) + a + e, and so w' is an (a + e)-witness for a. 

It follows from our claim that the identity map 1l is e-simplicial from Wit(L, W) to Wit(L, W). 
Symmetrically, IT = 1l is e-simplicial from Wit(L, W) to Wit(L, W), and we conclude as in 



Lemmas 4.3 and 4.4 that H(Wit(L, W)) and H(Wit(L, W')) are e-interleaved. □ 



Example 4.6. If (A, dx) is a metric space, and L, W are subsets of X, A can be chosen as the 



restriction of dx to W x L: A(u>,a;) = <ix(w,x) for any (w,x) e W x L. Then Lemma 4.5 can be 
restated in the following way. 

Lemma 4.7. Let (X,dx) be a metric space, and let L,W,W be subsets of X. For any e greater 
than the Hausdorff distance (ki{W, W) the persistence modules H(Wit(L, W)) and H(Wit(L, W')) 
are e-interleaved. 



Proof. Since e > du(W, W), C = {(w, w') e W x W' : dx(w, w') < e} is a correspondence between 
W and W . Moreover for any I e L and (w, w') e C, \dx(w, I) — dx(w' , l)\ < dx{w, w') < e. So the 
distortion of C with respect to L is upper bounded by e. □ 

Note that there is no equivalent of Lemma |4.5| in the case where the set L is perturbed, even if the 
set of witnesses is constrained to stay fixed (W = W). Here is a counterexample: on the real line, 
consider the sets W = W = L = {0, 1} and U = {—5, 0, 1, 1 + 6}, where 5 e (0, 1/2) is arbitrary. 
Then, Wit(L,W;a) = {[0], [1], [0,1]} for all a > 0, while Wit{L',W;a) = {[-5], [0], [1], [1 + 
5], [-5,0], [1,1 + 6]} for all a e [5,1- 5). Thus, H(Wit(£, W)) and R(Wit(L' ,W)) are not e- 
interleaved for any value e < 1 — 28, while dn(-^, L') = 8 can be made arbitrarily small compared 
to 1 - 28. 
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Generalisation to dissimilarity spaces 



In data analysis one often considers data sets X equipped with a dissimilarity measure, i.e. a 
map dx '■ X x X — > R that satisfies dx(x,x) < dx(x,y) = dx(y,x) for all x,y e X but is not 
required to satisfy any of the other metric space axioms. It is easily seen that Cech, Vietoris- 
Rips and witness complexes can still be defined over such a space X, and that the distortion of a 
correspondence C : X =» Y between such spaces is well-defined. Moreover, since the proofs of our 
interleaving results do not make use of any other distance axiom (triangle inequality, non- negativity, 
zero property), they remain valid for such spaces. 



5 Regularity of Rips and Cech filtrations 

5.1 Stability of persistence for Rips and Cech filtrations on precompact metric 
spaces 

Given a positive real number e > 0, a subset F c X of a metric space (X, dx) is an e-sample of X 
if for any ieX there exists / 6 F such that dx(x, f) < e. A precompact metric space (X, dx) is 
a space such that X has a finite e-sample for every e > 0. 

Proposition 5.1. If(X, dx) is a precompact metric space then the persistence modules H(Cech(X)) 
and H(IRips(A A )) are q-tame. 

Proof. Let us first consider the case of the Vietoris-Rips persistence module. We must show that 
the map l\ : H(Rips(X, a)) — > H(Rips(X, b)) induced by the inclusion has finite rank whenever 
a < b. Let e = (6 — a)/2. Since X is precompact there exists a finite e/2-sample F of X. The set 
C = {(x, f) £ X x F : dx(x, f) < e/2) is an e-corresponence so the Gromov-Hausdorff distance 



between F and X is upper bounded by e/2. It then follows from Lemma 4.4 that there exists an 
e-interleaving between H(Mips(X)) and H(IRips(i ? )). Using the interleaving maps, I* factorises as 

H(Rips(X, a)) -> H(Rips(F, a + e)) — H(Rips(X, a + 2e)) = H(Rips(X, b)). 

The second term is finite dimensional since Rips(F; a + e) is a finite simplicial complex; so I* has 
finite rank. 

The proof for the Cech persistence module is the same. □ 

The above proposition implies that the persistence diagrams of H(Cech(X)) and H(Mips(X)) are 
well-defined for precompact metric spaces. The persistence stability theorem |2.2| gives us the 
following result relating the Gromov-Hausdorff distance between two spaces to the bottleneck 
distance between the persistence diagrams of their Cech and Vietoris-Rips filtrations. 

Theorem 5.2. Let X, Y be precompact metric spaces. Then 

d b (dgm(H(Cech(X))),dgm(H(Cech(y)))) ^ 2d GH (X,y), 

d b (dgm(H(Mips(X))),dgm(H(Mips(y)))) ^ 2d GU (X,Y). 
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Remark. The second inequality of Theorem 5.2 has already been proven in [5] is the particular case 
of finite metric spaces using a different approach based upon the embedding of finite metric spaces 
into 



5.2 Dimension of homology groups of Rips and Cech nitrations on precompact 
metric spaces 

Although the persistence modules H(Cech(X)) and H(Mips(A)) are q-tame when X is a precompact 
metric space, the homology groups H(Cech(X, a)) and H(Rips(X, a)) are not necessarily finite 
dimensional even when X is compact. In this section we discuss and illustrate a few properties 
about the dimensionality of these groups. 

Homology groups of the Rips filtration 

It is easy to construct an example of a compact metric space X such that for a given a > 0, 
Hi(Rips(X, a)) has an uncountable infinite dimension. For example consider the union X of two 
parallel segments in R 2 defined by 

X = {(*, 0) e R 2 : t e [0, 1]} u {(t, a) e R 2 : t e [0, 1]} 

endowed with the restriction of the euclidean metric in R 2 . Then for any t e [0, 1], et = [(t, 0), (t, a)] 
is an edge of Rips(X, a) but there is no triangle in Rips (A, a) that contains et in its boundary. As a 
consequence, the cycles = [(0, 0), (t, 0)] + et + [(t, a), (0, a)] — eo, t e (0, 1], are not homologous to 
and are linearly independent in Hi(Rips(X, a)). Note that a is the only value for which the first 
homology group of the Vietoris-Rips complex Hi(Rips(X, .)) is not finite. However in general the 
set of values a such that the first homology group is infinite dimensional can be arbitrarily large. 

Proposition 5.3. For any a, (3 e R such that < a ^ (3 and any integer k there exists a compact 
metric space X such that for any a e [a, f3], Hfc(Rips(X, a)) has a non countable infinite dimension. 

Proof. The following example has been obtained with the help of J.-M. Droz who also proved that 
a similar example can be realised as a subset of R 4 endowed with the Euclidean metric [8] . 

Without loss of generality, we can assume that a = 1 and (3 = 2. Let us first consider the case 
k = 1. Consider the union X of the two non-parallel rectangles in R 3 defined by 

X = R 1 uR 2 

= {(t,0,z) e R 2 : t e [0,2],ze [0,1]} u {(t,l + \t,z) e R 2 : t e [0,2], ze [0,1]} cz R 3 
endowed with the restriction of the L 1 -norm in R 3 (see Figure [l]). 

Since we are using the L 1 -norm, for a e [1,2] and z e [0,1], the point (2(a — l),0,z) e R\ is 
at distance a from R 2 and its unique closest point on R 2 is (2(a — l),a, z). As a consequence 
e z = [(2(a — 1), 0, z), (2(a — 1), a, z)] is an edge of Rips(AT, a) for all z e [0, 1] but there is no (non 
degenerate) triangle in Rips (X, a) that contains e z in its boundary. As a consequence, the cycles 
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v t y ♦ 




Figure 1: The union X of the above 2 rectangles endowed with the restriction of the L metric in 
R 3 provides an example of a compact metric space such that dim Hi (Rips (X, a)) = +00 for any 
ae [1,2]. 

7 2 = [(2(o- l),0,0),(2(a- l),0,z)] + e z + [(2(a - 1), a, z), (2(a - l),a,0)] - e , ze (0,1] are not 
homologous to and are linearly independent in Hi(Rips(X, a)). 

To prove the lemma for k > 1, just consider the product of X with a (A; — l)-dimensional sphere of 
sufficiently large radius (to prevent the Rips construction from killing the {k — l)-homology), and 
apply the Kiinneth formula |13(, Theorem 3.16, p. 219]. □ 

The first homology groups of the Cech filtrations 

As for Vietoris-Rips filtrations, the homology groups of the Cech filtration of a compact set might 
be infinite dimensional as shown by [2] (Appendix B), except for the first homology groups. The 
following result was originally obtained by Smale et al. [H Theorem 8] with a different proof. 

Proposition 5.4. Let [X, d) be a precompact metric space, and let a > 0. Then, over any coefficient 
ring A and any a e R, the 1- dimensional homology Hi(Cech(X, a); A) is finitely generated over A. 
In particular, over a field k, 

dim k (Hi(Cech(X, a);k)) < +00. 

Proof. The proof of the lemma follows from a sequence of elementary remarks. 

1. Every 1-cycle in Cech(X, a) is homologous to a 1-cycle whose edges have length at most a. 

Proof: Any edge [x, x'] belonging to Cech(A, a) has an a-centre; that is, a point y which satisfies 
d(x,y) ^ a and d(x',y) ^ a. Since d(y,y) = < a, the point y is also an a-centre for the triangle 
[x, y, x'] and the edges [x, y], [y,x']. It follows that any 1-cycle 

7 = Y 1 ai[x i ,x' i \ 
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can be replaced by a homologous 1-cycle 

7 = 7 + <9^]a;|xi,y;,2-] = ([x^yi] + [j/j,x-]) 

i i 

all of whose edges [xi,yi], [yj,x£] have length at most a. □ 

2. There exists a finite set E a of edges of length at most a, with the following property: for any edge 
[x, y] of length at most a, there exists an edge [x', y'] in E a such that d(x, x') < a and d(y, y') < a. 

Proof: Since X is totally bounded, so is X x X with the product metric 

d((x,y), (x',y')) = max(d(x,x'),d(y,y')). 

Since X x X is totally bounded, so is its subspace 

[X x X] a = {(x,y) eXxX\ d(x,y) sc a} . 

Let (x[,y' 1 ), . . . , (x' N ,y' N ) be an a-sample for [X x X] a . Then 

E a = {[x' l ,y' l \...,[x N ,y' N ]} 

satisfies the required condition. □ 

3. Any 1-cycle can be written as a finite linear combination of cycles of the form 

[xi, x 2 ] + [x 2 , x 3 ] H h [x k -i, x k ] + [x fe , xi] (*) 

(k may vary). 

Proof: This is standard, but we give the proof explicitly. Certainly any 1-cycle 7 can be written as 
a finite linear combination of cycles (as above) and paths of the form 

[X1,X 2 ] + [X 2 ,X 3 ] + • • • + [x fe _i,Xjk] + [x k ,x k+1 ], xi 4= Xfc + i, 

(the trivial solution is to use paths of length 1 and no cycles). Consider the 'free' vertices in such 
a decomposition for 7: that is, vertices that occur as endpoints of the paths in the decomposition. 
We can eliminate the free vertices one by one as follows. Pick a free vertex and enumerate the 
paths which terminate there: P±, P2, ■ ■ ■ , P m - Since S7 = 0, we must have m ^ 2. We can decrease 
m strictly by concatenating P m with the appropriate multiple of P m -i or its reverse. This creates 
a new, longer path (or cycle, if the endpoints coincide) in place of P m , and rescales or annihilates 
P m -\- Eventually m = and the free vertex is eliminated. Finally, when there are no free vertices 
the decomposition involves only cycles, and we are done. □ 

4. Consider a cycle of the form 

7 = [x 1 ,x 2 ] + [x 2 ,x 3 ] H h [x fe _i,x fe ] + [x fc ,xi] 

whose edges have length at most a. Then 7 is homologous in Cech(X, a) to a cycle whose edges 
belong to E a . 
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Proof: Approximate each edge [xi, Xj+i] by an edge [x[, ?/•] e E a according to remark 2 (interpreting 
Xk+\ as x\, cyclically). We claim that 

7 = Wi, y[] + [y[,4] + K, y'2] + [2/2, 4] + • • • + [4, y^l + [i/*> 4] 

is homologous to 7 in Cech(X, a). Indeed 

7-7 = 5^] + [y-,Xi,x i+ i] + [y[, x i+ i, x' i+1 ]j 

(see figure]^]). To verify that the right-hand side of the equation belongs to Cech(X, a), note that the 
triangles [y[ and [y' i: Xi+i, have a-centres x'^, 37+1 and 37+1 respectively. □ 

Combining remarks 1, 3 and 4, we see that in Cech(X, a) any 1-cycle 7 is homologous to a 1-cycle 
involving only edges in the finite set E a . It follows that the homology Hi(Cech(A, a); A) is finitely 
generated. □ 



The open Vietoris— Rips filtration 

The above examples provided to prove that the first homology groups of the Vietoris-Rips filtration 
of a compact set X might be infinite dimensional strongly rely on the fact that the definition of 
Rips(A, a) involves a non-strict inequality, that is to say, a closed condition: [xq, 27, . . . , Xk] £ 
Rips(A, a) if and only if dx(xi, Xj) ^ a, for all i, j. 

It is thus natural to ask what happens if the strict inequality — an open condition — is used. Given 
a metric space (X, dx) and a real number a e R, the open Vietoris-Rips complex is the simplicial 
complex Rips(A, a~) with vertex set X defined by the following condition: 

[xo, xi j • • • , Xk] £ Rips(X, a~) o dx(xi, Xj) < a, for alii, j. 

The open condition gives us the following mild regularity result. 
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Proposition 5.5. For any precompact metric space X , any a > and any integer k, the space 
Hfc(Rips(X, a~)) has a countable basis. 

Proof. Any homology class H/%(Rips(X, a~)) is represented by some cycle c, a finite linear combina- 
tion of simplices of diameter strictly less than a, and therefore bounded away from a. It follows that 
the class is in the image of Hfc(Rips(X, a — — )) — > Hfc(Rips(X, a - )) for some positive integer n. Since 



Hfc(Kips(Jf)) is q-tame, by Proposition 5.1, this image is finite dimensional. Now Hfc(Rips(AT, a )) 



is the union of these finite dimensional images for n — > +oo, so it has a countable basis. □ 



We still cannot guarantee finite dimensionality. However, Proposition 5.5 suggests that we must 
proceed discretely if we are to find a counterexample. 

Proposition 5.6. For any given a > there exists a precompact metric space X such that 
Hi(Rips(X, a~)) has infinite dimension. 

Proof. Without loss of generality we assume that a = 1. We will construct a bounded subset 
X a R 2 whose open Vietoris-Rips complex Rips(X, 1~) has infinite-dimensional 1-dimensional 
homology. We will construct it as the union of two infinite sets L and R such that Rips(X, 1 — ) 
contains the complete graphs on L and R, and otherwise each vertex in L shares an edge with 
precisely one vertex in R, and vice versa. Following the same argument as the one used for the 



example before Proposition 5.3 we will deduce that Hi(Rips(A, 1 )) is infinite dimensonal. 



We define X in terms of an auxiliary function f{x) : [0, oo) — ► [0, oo), which will be identified later. 
We suppose initially that / is continuous, increasing, and positive except at /(0) = 0. Specifically, 
let 

L x = {f(x),x), R x = (1 - f(x),x) 
for < x < f^il/2). We define 

X = {L x , R x | x = e n , n » 0} 
where (e n ) is a decreasing positive sequence with limit 0. 

Clearly \L X — R x \ < 1 for all x > 0. We must arrange that \L X — R y \ ^ 1, for x, y distinct elements 
of the sequence (e n ). Suppose x > y. Then: 

\L x -R y \ 2 = {l-f{x)-f{y)f + {x-yf 
^(l-2/(x)) 2 + (x-y) 2 
>l-4f(x) + (x-y) 2 . 

Suppose we have chosen our sequence so that x > y implies x ^ 2y; for instance, by setting 
(e„) = (2-"). Then 

{x-yf>\x\ 

Now we choose f(x) = j§x 2 . Then for y32 > x > y in the sequence (2~ n ) we have 
\L X - R y \ 2 > 1 - 4/(x) + (s - y) 2 > 1 - \x 2 + \x 2 = 1 
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as required. 



It follows that if we define 

X = LvjR = {(2~ 2n - 4 , 2- n ) | n > 1} u {(1 - 2~ 2n - 4 , 2~ n ) \n>l} 

then Rips(AT, 1~) contains the complete graphs on L and R, and otherwise each vertex in L shares 
an edge with precisely one vertex in R, and vice versa. As a consequence there is no non-degenerate 
triangle in Rips (AT, l - ) that contains one of these edges connecting L to R. For each point x n = 
(2~ 2n ~ 4 ,2~ n ) e L, let y n e R be the corresponding point in R, so the edge e n = [x n ,y n ] is in 
Rips(X, 1 _ ). It follows that the cycles 7 n = e\ + [yi,y n ] — e n + [x n , x{\, n ^ 2, are not homologous 
to and are linearly independent. So dimHi(Rips(X, 1 — )) = +oo. □ 

The persistence diagram of Hi(RipspT)) when X is a path metric space 

Recall that a metric space (A, dx) is a path metric space if the distance between each pair of points 
is equal to the infimum of the lengths of the curves joining these two points]^] Every path metric 
space (X,dx) satisfies the following property (see |12j . Theorem 1.8): for any x,x' e X and any 
e > there exists a point z e X such that 

sup(dx(x, z),dx(x' z)) < ^dx(x, x') + e. 

Lemma 5.7. Let (A, dx) be a path metric space. For any b > 0, any 1- cycle in Rips(A, b) is 
homologous to a 1-cycle in Rips(A, 

Proof. Any 1-cycle 7 in Rips(A, b) can be written as a finite sum e\ H \- e n , of edges &{ = [xi, a^] 

of length at most b. For any i, there exists z% e X such that: 

dx(xi, Zi) ^dxixi^i) + e ^ |fe 
dx{x'i,Zi) sc +e sS §6 

As a consequence the triangle [sc$, x^, is contained in Rips(A, b) and 7 is homologous in Rips(A, b) 
to 

n 

7 = ^[xuZi] + [zi,x'i] 
i=l 

which is a 1-cycle in Rips (A, |&). □ 

Corollary 5.8. Let (X, dx) be a path metric space. Then the map Hi(Rips(A, a)) — » Hi(Rips(A, b)) 
is surjective, for all < a < b. 

Proof. Iterate the surjectivity of Hi(Rips(A, |6)) — ► Hi (Rips (X, b)). Eventually (|) b ^ a. □ 

From the corollary we immediately deduce (using the rectangle-measures and quiver calculus of [6 J ) 
the following result on the structure of the persistence diagram of Hi (Rips ( A")). 

3 See [12] chap.l, def 1.2 for a definition of the length of a curve in a metric space. 
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Proposition 5.9. Let (X,dx) be a precompact path metric space. Then the persistence diagram 
0/ Hi (Rips (X)) is contained in the vertical line {0} x [0, +00). 



Proof. It is enough to show that n(R) = for any rectangle which does not meet the line. Write 
R = [a, b] x [c, d], where —00 ^a<b<c<d^ +00. Either b < 0, in which case 

fi{R) = (o a — * b — . c — o d > < <. fc > = dim Hi (Rips (X, b)) = 

because there are no edges when b < 0. Or else < a, in which case 

H(R) = (o a —» b —» c —o d ) s: <o a — « 6 > = 

because Hi(Rips(X, a)) — > Hi(Rips(X, &)) is surjective. □ 

Remark. Note that Proposition |5.9| relies on the fact that in a length space any 1-dimensional 
simplex can be "subdivided" into a sum of smaller simplices, i.e. is homologous to a sum of simplices 
of strictly smaller diameter. This property does no longer hold for higher dimensional simplices. 
For example if (X, dx) is a circle of length 1, then any triple of points x, y, z such that dx{x, y) = 
d>x(y-> z ) — dx(z, x) = I spans a triangle in Rips(X, |) that is not h omologous to any finite sum of 



triangles of diameter strictly less than g. However Proposition 5.9 can be directly extended to the 
case of (5-hyperbolic metric spaces (see [TT] Chap. 2 for a definition) where one can then prove that 
the persistence diagram of H2(Mips(X)) is contained in a vertical band of the form 0(5) x [0, +00). 
The proof of this result is beyond the scope of this paper, but not too difficult; so it is left to the 
reader. 
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